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FUNCTIONAL DECONVOLUTION IN A PERIODIC SETTING: 

UNIFORM CASE 

By Marianna Pensky 1 and Theofanis Sapatinas 
University of Central Florida and University of Cyprus 



We extend deconvolution in a periodic setting to deal with func- 
tional data. The resulting functional deconvolution model can be 
viewed as a generalization of a multitude of inverse problems in math- 
ematical physics where one needs to recover initial or boundary con- 
ditions on the basis of observations from a noisy solution of a partial 
differential equation. In the case when it is observed at a finite num- 
ber of distinct points, the proposed functional deconvolution model 
can also be viewed as a multichannel deconvolution model. 

We derive minimax lower bounds for the L 2 -risk in the proposed 
functional deconvolution model when /(•) is assumed to belong to 
a Besov ball and the blurring function is assumed to possess some 
smoothness properties, including both regular-smooth and super-smooth 
convolutions. Furthermore, we propose an adaptive wavelet estima- 
tor of /(■) that is asymptotically optimal (in the minimax sense), or 
near-optimal within a logarithmic factor, in a wide range of Besov 
balls. 

In addition, we consider a discretization of the proposed func- 
tional deconvolution model and investigate when the availability of 
continuous data gives advantages over observations at the asymptoti- 
cally large number of points. As an illustration, we discuss particular 
examples for both continuous and discrete settings. 



1. Introduction. We consider the estimation problem of the unknown 
response function /(•) based on observations from the following noisy con- 
volutions: 
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where U = [a,b], — oo < a < b < oo, and T = [0, 1]. Here, z(u,t) is assumed 
to be a two-dimensional Gaussian white noise, that is, a generalized two- 
dimensional Gaussian field with covariance function 

E[z(ui,ti)z(u 2 ,t 2 )} = S(ui - u 2 )S(tx - t 2 ), 

where 5(-) denotes the Dirac (^-function, cr(-) is assumed to be a known 
positive function, and 



with the blurring (or kernel) function G(-,-) in (1.2) also assumed to be 
known. Note that, since cr(-) is assumed to be known, both sides of (1.1) can 
be divided by a(-) leading to the equation 



where y (u, t) = Y(u, t) /a(u) and g(u, t — x) = G(u, t — x) /a(u) . Consequently, 
without loss of generality, we consider only the case when cr(-) = 1 and thus, 
in what follows, we work with observations from model (1.3). 

The model (1.3) can be viewed as a functional deconvolution model. If 
a = b, it reduces to the standard deconvolution model which attracted at- 
tention of a number of researchers. After a rather rapid progress in this 
problem in late 1980s to early 1990s, authors turned to wavelet solutions of 
the problem [see, e.g., Donoho (1995), Abramovich and Silverman (1998), 
Kalifa and Mallat (2003), Johnstone, Kerkyacharian, Picard and Raimondo 
(2004), Donoho and Raimondo (2004), Johnstone and Raimondo (2004), 
Neelamani, Choi and Baraniuk (2004) and Kerkyacharian, Picard and Rai- 
mondo (2007)]. The main effort was spent on producing adaptive wavelet 
estimators that are asymptotically optimal (in the minimax sense), or near- 
optimal within a logarithmic factor, in a wide range of Besov balls and under 
mild conditions on the blurring function. [For related results on the density 
deconvolution problem, we refer to, e.g., Pensky and Vidakovic (1999), Wal- 
ter and Shen (1999), Fan and Koo (2002).] 

On the other hand, the functional deconvolution model (1.3) can be 
viewed as a generalization of a multitude of inverse problems in mathe- 
matical physics where one needs to recover initial or boundary conditions 
on the basis of observations of a noisy solution of a partial differential equa- 
tion. Lattes and Lions (1967) initiated research in the problem of recover- 
ing the initial condition for parabolic equations based on observations in a 
fixed-time strip. This problem and the problem of recovering the boundary 
condition for elliptic equations based on observations in an internal domain 
were studied in Golubev and Khasminskii (1999); the latter problem was also 
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discussed in Golubev (2004). These and other specific models are discussed 
in Section 5. 

Consider now a discretization of the functional deconvolution model (1.3) 
when y(u,t) is observed at n = NM points (ui,ti), 1 = 1, 2,. ..,M, i = 
1,2,..., N, that is, 



where en are standard Gaussian random variables, independent for different 
/ and i. In this case, the functional deconvolution model (1.3) can also be 
viewed as a multichannel deconvolution problem considered in, for example, 
Casey and Walnut (1994) and De Canditiis and Pensky (2004, 2006); this 
model is also discussed in Section 5. 

Note that using the same n in (1.3) (continuous model) and (1.4) (dis- 
crete model) is not accidental. Under the assumptions (3.3) and (4.1), the 
optimal (in the minimax sense) convergence rates in the discrete model are 
determined by the total number of observations, n, and coincide with the 
optimal convergence rates in the continuous model. 

In this paper, we consider functional deconvolution in a periodic setting, 
that is, we assume that, for fixed u & U, /(•) and g(u, •) are periodic functions 
with period on the unit interval T. Note that the periodicity assumption 
appears naturally in the above mentioned special models which (1.3) and 
(1.4) generalize, and allows one to explore ideas considered in the above cited 
papers to the proposed functional deconvolution framework. Moreover, not 
only for theoretical reasons but also for practical convenience [see Johnstone, 
Kerkyacharian, Picard and Raimondo (2004), Sections 2.3, 3.1-3.2], we use 
band-limited wavelet bases, and in particular the periodized Meyer wavelet 
basis for which fast algorithms exist [see Kolaczyk (1994) and Donoho and 
Raimondo (2004)]. 

In what follows, we derive minimax lower bounds for the L 2 -risk in mod- 
els (1.3) and (1.4) when /(•) is assumed to belong to a Besov ball and g(-, •) 
is assumed to possess some smoothness properties, including both regular- 
smooth and super-smooth convolutions. Furthermore, we propose an adap- 
tive wavelet estimator of /(•) and show that this estimator is asymptotically 
optimal (in the minimax sense), or near-optimal within a logarithmic factor, 
in a wide range of Besov balls. We also compare models (1.3) and (1.4), and 
investigate when the availability of continuous data gives advantages over 
observations at the asymptotically large number of points. 

The paper is organized as follows. In Section 2, we describe the construc- 
tion of a wavelet estimator of /(•) for both the continuous model (1.3) and 
the discrete model (1.4). In Section 3, we derive minimax lower bounds for 
the L 2 -risk, based on observations from either the continuous model (1.3) 
or the discrete model (1.4), when /(•) is assumed to belong to a Besov 
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ball and <?(•,•) is assumed to possess some smoothness properties, includ- 
ing both regular-smooth and super-smooth convolutions. In Section 4, we 
demonstrate that the wavelet estimator derived in Section 2 is adaptive and 
asymptotically optimal (in the minimax sense), or near-optimal within a 
logarithmic factor, in a wide range of Besov balls. In Section 5, we discuss 
particular examples for both continuous and discrete settings. We conclude 
in Section 6 with a discussion on the interplay between continuous and dis- 
crete models. Finally, in Section 7, we provide some auxiliary statements as 
well as the proofs of the theoretical results obtained in the earlier sections. 

2. Construction of a wavelet estimator. Let </?*(•) and ip*(-) be the Meyer 
scaling and mother wavelet functions, respectively [see, e.g., Meyer (1992) 
or Mallat (1999)]. As usual, 

<p* jk (x) = 2^V(2^x - k), r jk (x) = 2'/V(2 J x - k), j, k G Z, 

are, respectively, the dilated and translated Meyer scaling and wavelet func- 
tions at resolution level j and scale position k/2 3 . (Here, and in what follows, 
Z refers to the set of integers.) Similarly to Section 2.3 in Johnstone, Kerky- 
acharian, Picard and Raimondo (2004), we obtain a periodized version of 
Meyer wavelet basis by periodizing the basis functions {p* (■) , ip* (■)} , that 
is, 

<p jh (x) =$>VV(2 J '(z + - k), ^ fc (x) =5>'/V(2 J (x + i) - k). 

In what follows, (•, •) denotes the inner product in the Hilbert space L 2 {T) 
(the space of squared-integrable functions defined on the unit interval T), 
that is, (f,g) = J T f{t)g(t) dt for /, g G L 2 {T). [Here, and in what follows, h(-) 
(resp. h) denotes the conjugate of the complex function h(-) (resp. complex 
number h); h(-) (resp. h) is real if and only if h(-) = h(-) (resp. h = h).] 

Let e m (t) = e l27rmt , m£Z, and, for any (primary resolution level) jo > 
and any j > j , let 

be the Fourier coefficients of (fjk(-), i^jki') and /(•), respectively. Denote 
(2.1) h(u,t)= f(x)g(u,t-x)dx, ueU,teT. 

JT 

For each u€U, denote the functional Fourier coefficients by 



h m (u) = (e m ,h(u, •)), 
g m {u) = (e m ,g(u,-)), 



y m (u) = (e rn ,y(u,-)), 
z m (u) = (e m ,z(u,-)). 
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If we have the continuous model (1.3), then, by using properties of the 
Fourier transform, for each u £ U, we have h m (u) = g m (u)f m and 

(2.2) Vm(u) =g m {u)f m + —=z m (u), 

In 



where z m (u) are generalized one-dimensional Gaussian processes such that 

(2.3) E[z mi (ui)z m2 (u 2 )} = u mi ,7712 

6(ui - u 2 ), 

where Sm l)m2 is Kronecker's delta. In order to find the functional Fourier 
coefficients f m of /(•), we multiply both sides of (2.2) by g m (u) and integrate 
over u£U. The latter yields the following estimators of f m : 

(2.4) f m =^J g m (u)y m {u) du\ J (J \g m (u)\ 2 du\ . 

[Here, we adopt the convention that when a = b the estimator f m takes the 
form f m = g m (a)y m {a) /\g m (a)\ 2 .] 

If we have the discrete model (1.4), then, by using properties of the dis- 
crete Fourier transform, for each I = 1, 2, ... , M, (2.2) takes the form 

(2.5) y m {ui) = g m (ui)f m + -^=z ml , 

V TV 



where z m i are standard Gaussian random variables, independent for different 
m and I. Similarly to the continuous case, we multiply both sides of (2.5) 
by gm(ui) and add them together to obtain the following estimators of f m : 

(2.6) f m = \^2g m (ui)ym(ui)j j ^ \g m (ui)rj ■ 

[Here, and in what follows, we abuse notation and f m refers to both func- 
tional Fourier coefficients and their discrete counterparts. Note also that 
ym{ui), g m (ui) and z m i are, respectively, the discrete versions of the func- 
tional Fourier coefficients y m (u), g m (u) and z m (u).] 

Note that, using the periodized Meyer wavelet basis described above and 
for any jo > 0, any (periodic) /(•) E L 2 (T) can be expanded as 

2^0-1 oo 2J-1 

(2.7) f(t) = ]T a jok <p jok (t) + E E b ^ jk (t). 

k=Q j=j k=0 

Furthermore, by Plancherel's formula, the scaling coefficients, a,j k = (/, <Pj k)i 
and the wavelet coefficients, bjk = (f,ipjk), of /(•) can be represented as 



(2.8) dj Q k = fmfmjok, bjk = ^2 fm,lp< 



mjki 



meC jo 
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where C jo = {m : tp m j k / 0} and, for any j > j , Cj = {m : ip mjk / 0}, both 
subsets of 2vr/3[-2 J+2 , -2 j ] U [2 j , 2 j+2 }, due to the fact that Meyer wavelets 
are band-limited [see, e.g., Johnstone, Kerkyacharian, Picard and Raimondo 
(2004), Section 3.1]. We naturally estimate dj k and bjk by substituting f m 
in (2.8) with (2.4) or (2.6), that is, 



(2.9) 0-j(,k — ^ ' fm^Pmjoki ^jk — ^ ' fm^mjk- 

mSCj m£Cj 

We now construct a block thresholding wavelet estimator of /(•)■ F° r this 
purpose, we divide the wavelet coefficients at each resolution level into blocks 
of length Inn. Let Aj and Uj r be the following sets of indices: 

Aj ={r\r= 1,2,..., 2 j /Inn}, 

U jr = {k | k = 0,1,.. .,2 j - l;(r - l)lnn < k <rlnn- 1}. 

Denote 

(2.10) B jr = £ b%, B jr = E b%- 
Finally, for any jo > 0, we reconstruct /(•) as 

2J0-1 J-l 

(2.11) f n (t)= E % fc^ofe(*) + E E E M(l%l>^)^(*)» 

k=0 j=jo r€Aj keUj r 

where 1(^4) is the indicator function of the set A, and the resolution levels 
jo and J and the thresholds Aj will be defined in Section 4. 

In what follows, we use the symbol C for a generic positive constant, 
independent of n, which may take different values at different places. 

3. Minimax lower bounds for the L 2 -risk over Besov balls. Among the 
various characterizations of Besov spaces for periodic functions defined on 
L P (T) in terms of wavelet bases, we recall that for an r-regular multireso- 
lution analysis with < s < r and for a Besov ball q (A) of radius A > 
with 1 < p, q < co, one has that, with s' = s + 1/2 — 1/p, 

/(•) G L P (T) : ( E KofcN 

(3.1) 

/ oo /2J-1 \ <j/P\ 1/9 "| 

+ E 2JA £W <4 

\j=j \k=0 / / J 

with respective sum(s) replaced by maximum if p = co or q = oo [see, e.g., 
Johnstone, Kerkyacharian, Picard and Raimondo (2004), Section 2.4]. (Note 
that, for the Meyer wavelet basis, considered in Section 2, r = co.) 
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We construct below minimax lower bounds for the L 2 -risk, for both the 
continuous model (1.3) and the discrete model (1.4). For this purpose, we 
define the minimax L 2 -risk over the set Q, as 

R n (Q) = mi su P E||/ n -/|| 2 , 

fn fen 

where \\g\\ is the L 2 -norm of a function g(-) and the infimum is taken over 
all possible estimators f n { ) (measurable functions taking their values in a 
set containing f2) of /(•), based on observations from either the continuous 
model (1.3) or the discrete model (1.4). [Here, and in what follows, the 
expectation is taken under the true /(•), and it is assumed that the function 
class Q contains /(•)■] 

In what follows, we shall evaluate a lower bound for R n (Bp q (A)). Denote 

s* = s + l/2-l/p', p' = min(p,2), 

and, for k = 1, 2, define 

rb 

\g m (u) | 2k du, in the continuous case, 

(3.2) T K (m) = { m 
~r? \ 9m(ui)\ 2K , in the discrete case. 

i=i 

[Here, we adopt the convention that when a = b, r K (m) takes the form 
T K( m ) = |fl , m(o)| 2fC ) K = l, 2.] Assume that for some constants v € R, a > 0, 
(3 > and K\ > 0, independent of m, the choice of M and the selection 
points Ui, I = 1, 2, ... , M, 

(3.3) n(m) < A'i|m|~ 2 ' / exp(-a|m| /3 ), v > if a = 0. 

[Following Fan (1991), we say that the function <?(•,•) is regular- smooth if 
a = and is super-smooth if a > 0.] 

The following statement provides the minimax lower bounds for the L 2 - 
risk. 

Theorem 1. Let {0j o ,fe(-)jV , i,fe(")} ^ e ^ e periodic Meyer wavelet basis 
discussed in Section 2. Let s > max(0, 1/p— 1/2), 1 < p < oo, 1 < q < oo and 
A > 0. Then, under the assumption (3.3), as oo, 

Cn -2s/{2s+2u+i)^ ifa = 0, u(2-p)<ps*, 

/In n \ 2s* /{2s*+2v) 

(3.4) R n {B s p ^A))>\ C(^J , i/a = 0,!/(2-p)>fw*, 

-2s*//3 



C(lnnT^ t/a>0. 



<s 
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Remark 1. The two different lower bounds for a = in (3.4) refer 
to the dense case [u{2 — p) < ps*} when the worst functions /(•) (i.e., the 
hardest functions to estimate) are spread uniformly over the unit interval 
T, and the sparse case [v(2 — p) > ps*] when the worst functions /(■) have 
only one nonvanishing wavelet coefficient. Note also that the restriction s > 
max(0, 1/p— 1/2), 1 <p < oo and 1 < q < oo ensures that the corresponding 
Besov spaces are embedded in L 2 (T). 

4. Minimax upper bounds for the Z 2 -risk over Besov balls. Recall (3.3) 
from Section 3, and assume further that for the constants v G M, a > and 
(3 > 0, and for a constant K<i > 0, independent of m, the choice of M and 
the selection points ui, I = 1,2, ... , M, with K2 < K\, 

(4.1) Ti(m)>K 2 \m\^ 2u exp(-a\m\ (3 ), v > if a = 0. 

For any j > jo, let \Cj\ be the cardinality of the set C,-; note that, for 
Meyer wavelets, \Cj\ = ^tt2 3 [see, e.g., Johnstone, Kerkyacharian, Picard 
and Raimondo (2004), page 565]. Let also 

(4-2) A ^=\h £ T »H[lH]"* K = 1 ' 2 - 

Then, direct calculations yield that 

f ci2 2uj if a = 

(4 ' 3 ) A ' 0)£ { C2 2^'e Xp ( a (f)V), ifoo' 

[Note that since the functional Fourier coefficients g m (•) are known, the 
positive constants c\ and C2 in (4.3) can be evaluated explicitly] 

Consider now the two cases a = (regular-smooth) and a > (super- 
smooth) separately. Choose jo and J such that 

(4.4) 2 jo =\nn, 2 J =n 1/ ^ +1) if a = 0. 

(4.5) 2*> = — f— V //3 , 2 J = 2*\ ifa>0. 
V ; 8vr V 2a J ' 

[Since jo > <^ — 1 when a > 0, the wavelet estimator (2.11) only consists of 
the first (linear) part and, hence, Xj does not need to be selected in this 
case.] Set, for some positive constant d, 

(4.6) Xj = dn~ l 2 2vj \nn if a = 0. 

Note that the choices of jo, J and Xj are independent of the parameters, s, 
p, q and A (that are usually unknown in practical situations) of the Besov 
ball Bp q (A); hence, the wavelet estimator (2.11) is adaptive with respect to 
these parameters. 

The proof of the minimax upper bounds for the L 2 -risk is based on the 
following two lemmas. 



FUNCTIONAL DECONVOLUTION 9 

Lemma 1. Let the assumption (4-1) be valid, and let the estimators cij ok 
and bjk of the scaling and wavelet coefficients a,j ok and bj k , respectively, be 
given by the formula (2.9) with f m defined by (2.4) in the continuous model 
and by (2.6) in the discrete model. Then, for k = 1,2, and for all j > jo, 

(4.7) E\a jok - a jok \ 2 < Cn^A^jo), 

(4.8) E\b jk -b jk \ 2K <Cn- K A K (j). 

Moreover, under the assumptions (3.3) and (4-1) with a = 0, for all j > jo, 

(4.9) A 2 (j) <C2^ 2u -^\ 
for any < v\ < v. 

Lemma 2. Let the estimators bj k of the wavelet coefficients bj k be given 
by the formula (2.9) with f m defined by (2.4) in the continuous model and 
by (2.6) in the discrete model. If fi is a positive constant large enough and 
a = in the assumption (4-1); then, for all j > jo, 



(4.10) 



|&jfc-^fe| 2 >0.25/^ 2 n _1 2 2l/j lnn) < 



3i/-4i/!+2)/(2i/+l) 



\k&Ur 



for any < v\ < v . 



Lemmas 1 and 2 allow to state the following minimax upper bounds for 
the L 2 -risk of the wavelet estimator / n (-) defined by (2.11), with jo and J 
given by (4.4) (if a = 0) or (4.5) (if a > 0). Set (x) + = max(0,x), and define 

( (2u + l){2-p) + 



(4.11) 



Qx 



p{2s + 2v + l) 

(q-p)+ 



lo, 



if v(2 -p) <ps*, 

if v(2 -p) =ps*, 
if v(2 — p) > ps*. 



Theorem 2. Let f n (-) be the wavelet estimator defined by (2.11), with jo 
and J given by (4-4) (tf ce = 0) or (4-5) (if a> 0). Let s > 1/p' , 1 < p < oo, 
1 < q < oo and A > 0. Then, under the assumption (4-1), as n — > oo, 

'Cn- 2 »/(2»+»'+i)(l I m)w, 

if a = 0,z/(2 -p) <ps*, 

/lnn\ 2s * /(2s * +2v) 
(4.12) sup E||/ n -/|| 2 <<| C {— ) O 11 ^ 1 ' 

f^M) ifa = 0,v(2-p)>ps*, 

C(lnn)- 2s */' 3 1 
if a > 0. 
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Remark 2. In the discrete model, assumptions (3.3) and (4.1) require 
the value of Ti(m) to be independent of the choice of M and the selection 
of points ui, I = 1, 2, . . . , M. If assumptions (3.3) and (4.1) hold, then the 
minimax convergence rates in discrete and continuous models coincide and 
are independent of the configuration of the points ui, I = 1,2, ... , M. More- 
over, the wavelet estimator (2.11) is asymptotically optimal (in the minimax 
sense) no matter what the value of M is. It is quite possible, however, that 
in the discrete model, conditions (3.3) and (4.1) both hold but with differ- 
ent values of v, a and [3. In this case, the upper bounds for the risk in the 
discrete model may not coincide with the lower bounds and with the mini- 
max convergence rates in the continuous model. Proposition 1 in Section 6 
provides sufficient conditions for the minimax convergence rates in discrete 
and continuous models to coincide and to be independent of M and the 
configuration of the points ui, 1 = 1,2, ...,M. These conditions also guar- 
antee asymptotical optimality of the wavelet estimator (2.11), and can be 
viewed as some kind of uniformity conditions. If conditions of Proposition 
1 are violated, then the rates of convergence in the discrete model depend 
on the choice of M and ui, I = 1,2, . . . , M, and some recommendations on 
their selection should be given. Furthermore, optimality issues become much 
more complex when T\(m) is not uniformly bounded from above and below 
(see the discussion in Section 6). 

Remark 3. Theorems 1 and 2 imply that, for the L 2 -risk, the wavelet 
estimator / n (-) defined by (2.11) is asymptotically optimal (in the minimax 
sense), or near-optimal within a logarithmic factor, over a wide range of 
Besov balls Bp (A) of radius A > with s > max(l/p, 1/2), 1 < p < oo and 
1 < q < oo. In particular, in the cases when (1) a > 0, (2) a = 0, v(2 — p) < 
ps* and 2 < p < oo, (3) a = 0, u(2 - p) > ps*, and (4) a = 0, u(2 - p) =ps* 
and 1 < q < p, the estimator (2.11) is asymptotically optimal (lower and 
upper bounds coincide up to a multiplicative constant), that is, 



Rn(B a p>q (A)) x 



' n -2s/(2s+2u+l) 
/hm\ 2s */( 2s *+ 2 ^ 



if a = 0, u(2- 

if a = 0, i/(2- 

or a = 0, v(2 
if a > 0. 



-p) <ps*, 2 <p< oo, 
-p) >ps*, 
— p) =ps*, l<q<p, 



. (Inn)- 2 **//?, 



On the other hand, in the case when a = 0, z^(2 — p) < ps* and 1 < p < 2 or 
a = 0, u{2 — p) = ps* and 1 < p < q, the wavelet estimator / n (-) defined by 
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(2.11) is asymptotically near-optimal within a logarithmic factor, that is, 

' £ n -2s/(2s+2^+l)( lnn )(2^+l)(2-p)/(p(2 S +2^+l)) ^ 



SUp nfn-ff<{ 
/6B' i9 (A) 



if a = 0, i/(2— p) <ps*, 
1 <P< 2, 

/l nn \ 2s*/(2s*+2^) 



(l nn )(l-p/9) 5 

\ n J 

if a = 0, i/(2 -p) = ps*, 

1 <P < q- 



[Here, and in what follows, gi(n) x 32 (^) denotes < liminf(gi(ra)/(/2(fi)) < 
limsup(<7i(n)/<72(^)) < °o as n — > 00.] 

Remark 4. For the L 2 -risk, the upper bounds (4.12) are tighter than 
those obtained by Chesneau (2008) for the regular-smooth case [i.e., a = in 
(3.3) and (4.1)] in the case of the standard deconvolution model [i.e., when 
a = b in (1.3)], although the difference is only in the logarithmic factors. 
More specifically, the following minimax upper bounds obtained in Chesneau 
(2008) for the L 2 -risk, as n — >■ 00: 



(4.13) 



where 



(4.14) 



sup E| 



/II 2 < 



Cn -2s/(2s+2u+l) ( i nn y 2 ^ 

if a = 0, i/(2 — p) < ps', 

C( I (lnn)^, 

0,i/(2-p)>ps', 



n 
if a 



^2 



2sl(l <p< 2) 
2s + 2i/ + 1 

(2g-p)+ 



0. 



if z/(2 -p) <ps', 



if i/(2 
if i/(2 



■p) =ps', 
p) > ps'. 



[Here, and in what follows, 1(A) is the indicator function of the set A.] Note 
that when 2<p<oo, s* = s < s' , and only the dense case appears; hence, 
in this case, the dense cases and the corresponding convergence rates in the 
minimax upper bounds given by (4.11)-(4.12) and (4.13)-(4.14) coincide 
since i/(2 — p) < ps* =ps < ps'. On the other hand, when 1 < p < 2, s* = s' , 
both the dense and sparse cases appear; hence, in this case, both the dense 
and sparse cases and the corresponding convergence rates in the minimax 
upper bounds given by (4.11)-(4.12) and (4.13)-(4.14) coincide. Looking 
now at (4.11) and (4.14), we see that Q2 = Q\ only when z/(2 — p) > ps' . 
On the other hand, Q2 > Q\ when 1 < p < 2 and v(2 — p) < ps' since (2/p — 
l)(2i/ + 1) — 2s = 2(2v — pv — ps') jp < 0, and it is obvious that Q2 > Q\ when 
z/(2— p) =ps' . However, we believe that the slight superiority in the minimax 
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convergence rates for the L 2 -risk obtained in Theorems 1 and 2 is due not to 
a different construction of the wavelet estimator but to a somewhat different 
way of evaluating the minimax upper bounds. 

Remark 5. Unlike Chesneau (2008) who only considered minimax up- 
per bounds for the regular-smooth case [i.e., a = in (3.3) and (4.1)] in the 
standard deconvolution model [i.e., when a = b in (1.3)], Theorems 1 and 2 
provide minimax lower and upper bounds (in the L 2 -risk) for both regular- 
smooth and super-smooth convolutions [i.e., a > in (3.3) and (4.1)], not 
only for the standard deconvolution model but also for its discrete counter- 
part [i.e., when M = 1 in (1.4)]. 

Remark 6. The wavelet estimator f n (-) defined by (2.11) is adaptive 
with respect to the unknown parameters s, p, q and A of the Besov ball 
Bj hq (A) but is not adaptive with respect to the parameters a, j3 and v in 
(3.3) and (4.1). It seems that it is impossible to achieve adaptivity with 
respect to f3 in the super-smooth case (a > 0) because of the very fast expo- 
nential growth of the variance. However, in the regular-smooth case (a = 0), 
one can construct a wavelet estimator which is adaptive with respect to the 
unknown parameter v. Choose jo and J such that 2- 70 = Inn and 2 J = n, 
and set Xj = d*n~ 1 lnnAi(j), where d* is large enough. Note that Ai(j) can 
be calculated whenever the functional Fourier coefficients g m {-) are avail- 
able. Also, K*2 2v i < A\(j) < if|2 2 ^ for some positive constants K{ and 
K2 which depend on the particular values of the constants in the condi- 
tions (3.3) and (4.1). Therefore, in this situation, by repeating the proof of 
Theorem 2 with these new values of the parameters involved, one can easily 
verify that the optimal convergence rates in Theorem 2 still hold as long as 
d* is large enough. How large should be "large enough" ? Direct calculations 
show that d* should be such that (0.5d* Jk^/K\ - l) 2 > Su + 2. Since Kf, 
K2 and v are unknown, it is impossible to evaluate the lower bound for d* . 
However, one can replace d* by a slow-growing function of n, say In Inn, 
leading to, at most, an extra In Inn factor in the obtained maximal L 2 -risk. 

Remark 7. We finally note that, although we have only considered 
L 2 -risks in our analysis, the results obtained in Theorems 1 and 2 can be 
extended to the case of L^-risks (1 < n < 00). Analogous statements to the 
ones given in Theorems 1 and 2 but for a wider variety of risk functions can 
be obtained using the unconditionality and Temlyakov properties of Meyer 
wavelets [see, e.g., Johnstone, Kerkyacharian, Picard and Raimondo (2004), 
Appendices A and B]. The details in the derivation of these statements 
should, however, be carefully addressed. 
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5. Examples in continuous and discrete models. The functional decon- 
volution model (1.3) can be viewed as a generalization of a multitude of 
inverse problems in mathematical physics where one needs to recover initial 
or boundary conditions on the basis of observations of a noisy solution of 
a partial differential equation. Lattes and Lions (1967) initiated research in 
the problem of recovering the initial condition for parabolic equations based 
on observations in a fixed-time strip. This problem and the problem of recov- 
ering the boundary condition for elliptic equations based on observations in 
an internal domain were studied in Golubev and Khasminskii (1999). More 
specifically, by studying separately the heat conductivity equation or the 
Laplace equation on the unit circle, and assuming that the unknown initial 
or boundary condition belongs to a Sobolev ball, Golubev and Khasminskii 
(1999) obtained some linear and nonadaptive solutions to the particular 
problem at hand; see also Golubev (2004) for a linear adaptive estimator for 
the Laplace equation on the circle based on the principle of minimization of 
penalized empirical risk. We also note that, unlike Golubev and Khasminskii 
(1999) and Golubev (2004) who considered sharp asymptotics, we focus our 
study on rate optimality results. [Note that the estimation of the unknown 
initial condition for the heat conductivity equation, allowing also for miss- 
ing data, has been recently considered by Hesse (2007); however, this latter 
paper deals with the density deconvolution model and the approach given 
therein varies from the approach of Golubev and Khasminskii (1999) and 
Golubev (2004), and it seems to be having a different agenda.] 

In view of the general framework developed in this paper, however, the 
inverse problems mentioned above can all be expressed as a functional de- 
convolution problem, so that all techniques studied in Sections 2-4 can be 
directly applied, to obtain linear / nonlinear and adaptive solutions over a 
wide range of Besov balls. Such solutions are provided in Examples 1-4 below 
which discuss some of the most common inverse problems in mathematical 
physics which have already been studied as well as some other problems 
which, to the best of our knowledge, have not yet been addressed. 

On the other hand, in the case when the functional deconvolution model 
(1.3) is observed at a finite number of distinct points [see (1.4)], it can also be 
viewed as a multichannel deconvolution model studied in De Canditiis and 
Pensky (2004, 2006). Example 5 below deals with this model, providing the 
minimax convergence rates (in the L 2 -risk) for regular-smooth [i.e., a = in 
(3.3) and (4.1)] and super-smooth [i.e., a > in (3.3) and (4.1)] convolutions, 
and also discussing the case when M can increase together with N; both 
of these aspects were lacking from the theoretical analysis described in De 
Canditiis and Pensky (2006). 
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Example 1 (Estimation of the initial condition in the heat conductivity 
equation). Let h(t,x) be a solution of the heat conductivity equation 

dh(t,x) d 2 h(t,x) r „ „, r „ , 

m = dx 2 > SG[0,1], t€[a,6], a>0, 6<oo, 

with initial condition h(0, x) = f(x) and periodic boundary conditions 

dh(t,x) 



h(t,0)=h(t,l), 



dx 



x=0 



dx 



2 = 1 



We assume that a noisy solution y(t, x) = h(t, x) + n~ 1 / 2 z(t, x) is observed, 
where z(t, x) is a generalized two-dimensional Gaussian field with covariance 
function E[z(ii, x±)z(t2, ^2)] = $(ti — t2)5(x\ —X2), and the goal is to recover 
the initial condition /(•) on the basis of observations y(t,x). This problem 
was considered by Lattes and Lions (1967) and Golubev and Khasminskii 
(1999). 

It is well known [see, e.g., Strauss (1992), page 48] that, in a periodic 
setting, the solution h(t,x) can be written as 

\2 



(5.1) h(t,x) = (Antr^ [ 1 J2™A JX + l ^ ) 



f(z)dz. 

It is easy to see that (5.1) coincides with (2.1) with t and x replaced by u 
and t, respectively, and that 

[t + kf 



< 7 (n,t) = (47ra)- 1 / 2 ^exp[- 



4(t 



fcez 

Applying the theory developed in Sections 2-4, we obtain functional Fourier 
coefficients g m (-) satisfying g m (u) = exp(— 4ir 2 m 2 u), and 

ri(m) = / \g m (u)\ 2 du = Cm~ 2 exp(—8TT 2 m 2 a)(l + o(l)), \m\ — > 00, 



so that v = 1, a = 87r 2 a and j3 = 2 in both (3.3) and (4.1). 

Hence, one can construct an adaptive wavelet estimator of the form (2.11), 
with jo and J given by (4.5), which achieves minimax (in the L 2 -risk) con- 
vergence rate of order (lnn) _s over Besov balls BiJA) of radius A > 
with s > max(l/p, 1/2), 1 < p < 00 and 1 < q < 00. 

Example 2 (Estimation of the boundary condition for the Dirichlet prob- 
lem of the Laplacian on the unit circle). Let h(x,w) be a solution of the 
Dirichlet problem of the Laplacian on a region D on the plane 

d 2 h(x,w) d 2 h(x,w) . „ o 

(5.2) A-j ' + d { w2 ' =0, (x,w)£DCR 2 , 
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with a boundary dD and boundary condition 
(5.3) h(x,w)\ dD = F(x,w). 

Consider the situation when D is the unit circle. Then, it is advantageous 
to rewrite the function h(-, •) in polar coordinates as h(x,w) = h(u,t), where 
u € [0, 1] is the polar radius and t € [0, 2ir] is the polar angle. Then, the 
boundary condition in (5.3) can be presented as h(l,t) = f(t), and h(u,-) 
and /(•) are periodic functions of t with period 2ir. 

Suppose that only a noisy version y(u,t) = h(u,t) + n -1 ' 2 ^(it, t) is ob- 
served, where z(u,t) is as in Example 1, and that observations are available 
only on the interior of the unit circle with u G [0, ro], ro < 1, that is, a = 0, 
b = tq < 1. The goal is to recover the boundary condition /(•) on the ba- 
sis of observations y(u,t). This problem was investigated in Golubev and 
Khasminskii (1999) and Golubev (2004). 

It is well known [see, e.g., Strauss (1992), page 161] that the solution 
h(u,t) can be written as 

L/ , (1-n 2 ) f2« f(x) 

h(u,t) = - - / J -j-i- — r ^dx. 

2ir Jo 1 — 2ucos(t — x) + u A 

Applying the theory developed in Sections 2-4 with e m (t) = e %mt and 

1 -u 2 

9(u,t) = -— ^, 

1 — 2ucos{t) + u z 

we obtain functional Fourier coefficients g m (-) satisfying g m (u) = Cu m , and 
n(m)=/ \g m (u)\ du = Cexp{-21n(l/r )|m|}, 



so that v = 0, a = 21n(l/r ) and /3 = 1 in both (3.3) and (4.1). 

Hence, one can construct an adaptive wavelet estimator of the form (2.11), 
with jo and J given by (4.5), which achieves minimax (in the L 2 -risk) con- 
vergence rate of order (lnn)~ 2s over Besov balls A A) of radius A > 
with s > max(l/p, 1/2), 1 < p < oo and 1 < q < oo. 



Example 3 (Estimation of the boundary condition for the Dirichlet prob- 
lem of the Laplacian on a rectangle). Consider the problem (5.2)— (5.3) 
in Example 2 above, with the region D being now a rectangle, that is, 
(x, w) E [0, 1] x [a, b] , a > 0, b < oo, and periodic boundary conditions 

h(x,0) = f(x), h(0,w)=h(l,w). 

Again, suppose that only a noisy version y(x,w) = h(x,w) + n -1 / 2 z(x,w) is 
observed, where z(x,w) is as in Example 1, for xE [0, 1], w € [a, b], and the 
goal is to recover the boundary condition /(•) on the basis of observations 
y{x,w). 
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It is well known [see, e.g., Strauss (1992), pages 188, 407] that, in a peri- 
odic setting, the solution h(x,w) can be written as 

(5 - 4) h{x > w) = Si S ^ + {x W +k - z y f{z) dz - 

It is easy to see that (5.4) coincides with (2.1) with x and w replaced by t 
and u, respectively, and that 



s( "' ,)= '"S^Fi) ! ' 



Applying the theory developed in Sections 2-4, we obtain functional Fourier 
coefficients g m (-) satisfying g m (u) = exp(— 2-nmu), and 

ri(m) = / \g m (u)\ 2 du = C\m\~ l exp(— 47r|m|a)(l + o(l)), |m| 



oo, 

so that v = 1/2, a = 4ira and (5 = 1 in both (3.3) and (4.1). 

Hence, one can construct an adaptive wavelet estimator of the form (2.11), 
with jo and J given by (4.5), which achieves minimax (in the L 2 -risk) con- 
vergence rate of order (lnn)~ 2s over Besov balls BiJA) of radius A > 
with s > max(l/p, 1/2), 1 < p < oo and 1 < q < oo. 

Example 4 (Estimation of the speed of a wave on a finite interval). Let 
h(t,x) be a solution of the wave equation 

d 2 h(t,x) _ d 2 h(t,x) 
dt 2 dx 2 
with initial-boundary conditions 

dh(t,x 



h(0,x) = 0, 



dt 



= f(x), h(t,0)=h(t,l) = 0. 

t=0 



Here, /(•) is a function defined on the unit interval [0, 1], and the objective 
is to recover /(•) on the basis of observing a noisy solution y(t, x) = h(t, x) + 
n~ 1 / 2 z(f, x), where z(t,x) is as in Example 1, with i € [a, b], a > 0, b< 1. 

Extending /(•) periodically over the real line, it is well known that the 
solution h(t,x) can then be recovered as [see, e.g., Strauss (1992), page 61] 

(5.5) hit, x) = \ f 1 l(\x -z\< t)f(z) dz, 

Jo 

so that (5.5) is of the form (2.1) with g(u,x) = 0.5I(|x| < u) (a boxcar- 
like kernel for each fixed u), where u in (2.1) is replaced by t in (5.5). 
Applying the theory developed in Sections 2-4, with t and x replaced by u 
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and t, respectively, we obtain functional Fourier coefficients g m (-) satisfying 
9m{u) = sm(2ir mu) / (2ttiti) , and 

ri (m) = / \g m (u)\ 2 du 
(5.6) Ja 

a sin(47r?na) — sin(47rm6) 



47T 2 m 2 V 2 87T7B 

Observe that the integral in (5.6) is always positive, bounded from above 
by Cm~ 2 and from below by Cm~ 2 [(b — a) — (27rm) -1 ], so that v = 1 and 
a = in both (3.3) and (4.1). 

Hence, one can construct an adaptive block thresholding wavelet estima- 
tor of the form (2.11), with j and J given by (4.4), which achieves the 
following minimax upper bounds (in the L 2 -risk): 

( Cn~ 2s ^ 2s+z \\nnY\ if s > 3(1 /p - 1/2), 

sup E||/„-/f< /InnV'As'+i) 
/efi|, 9 (A) I V~n~J (in^r 1 , if s < 3(l/p - 1/2), 

over Besov balls Bp q (A) of radius A > with s > 1/p' , l<p<c» and 1 < 
g < oo, where gt = 3(2/p-l)+/(2a + 3) if a > 3(l/p- 1/2), = (l-p/?)+ if 
s = 3(l/p— 1/2) and g\ = if s < 3(l/p — 1/2). [The minimax lower bounds 
(in the L 2 -risk) have the same form with g\ = 0.] 

Example 5 (Estimation in the multichannel deconvolution problem). Con- 
sider the problem of recovering /(•) G L 2 (T) on the basis of observing the 
following noisy convolutions with known blurring functions <#(•) 

(5.7) Y l (dt) = f*g l (t)dt + ^W l (dt), teT,l = l,2,...,M. 

x/n 



Here, o~i are known positive constants and Wi(t) are independent standard 
Wiener processes. 

The problem of considering systems of convolution equations was first 
considered by Casey and Walnut (1994) in order to evade the ill-posedness 
of the standard deconvolution problem, and was adapted for statistical use 
(in the density deconvolution model) by Pensky and Zayed (2002). Wavelet 
solutions to the problem (5.7) were investigated by De Canditiis and Pensky 
(2004, 2006). 

Note that deconvolution is the common problem in many areas of signal 
and image processing which include, for instance, LIDAR (Light Detection 
and Ranging) remote sensing and reconstruction of blurred images. LIDAR 
is a laser device which emits pulses, reflections of which are gathered by a 
telescope aligned with the laser [see, e.g., Park, Dho and Kong (1997) and 
Harsdorf and Reuter (2000)]. The return signal is used to determine distance 
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and the position of the reflecting material. However, if the system response 
function of the LIDAR is longer than the time resolution interval, then the 
measured LIDAR signal is blurred and the effective accuracy of the LIDAR 
decreases. If M (M > 2) LIDAR devices are used to recover a signal, then we 
talk about a multichannel deconvolution problem. Note that a discretization 
of (5.7) (with ai = 1 for I = 1,2, ... , M) leads to the discrete setup (1.4). 

Adaptive term by term wavelet thresholding estimators for the model (5.7) 
were constructed in De Canditiis and Pensky (2006) for regular-smooth con- 
volutions [i.e., a = in (3.3) and (4.1)]. However, minimax lower and upper 
bounds were not obtained by these authors who concentrated instead on 
upper bounds (in the L^-risk, 1 < tt < oo) for the error, for a fixed response 
function. Moreover, the case of super-smooth convolutions [i.e., a > in 
(3.3) and (4.1)] and the case when M — > oo have not been treated in De 
Canditiis and Pensky (2006). 

Let us now discuss the regular-smooth convolution case treated in De Can- 
ditiis and Pensky (2006), that is, the case when (in our notation) \g m (ui)\ ~ 
Q\m\- Ul with < Q < oo, I = 1, 2, . . . ,M. If M is fixed, then 

C*M _1 m _2l/min < n(m) < C*m~ 2iw , 

where v m \ n = min{^i, v%, • • • , vm} and < C* < C\ < C* < oo, I = 1,2, ... , M. 
Hence, the minimax rates of convergence (in the L 2 -risk) are determined by 
i/ m i n only, meaning that one can just rely on the best possible channel and 
disregard all the others. However, the latter is no longer true if M — ► oo. In 
this case, the minimax rates of convergence (in the L 2 -risk) are determined 
by T\ (m) which may not be a function of ^ m i n only. 

Consider now the adaptive block thresholding wavelet estimator / n (-) 
defined by (2.11) for the model (5.7) 07 = 1 for I = 1, 2, . . . , M or its discrete 
counterpart (1.4). Then, for the L 2 -risk, under the assumption (3.3), the 
corresponding minimax lower bounds are given by Theorem 1, while, under 
the assumption (4.1), the corresponding minimax upper bounds are given 
by Theorem 2. Thus, the proposed functional deconvolution methodology 
significantly expands on the theoretical findings in De Canditiis and Pensky 
(2006). 

6. Discussion: the interplay between continuous and discrete models. 

The minimax convergence rates (in the L 2 -risk) in the discrete model depend 
on two aspects: the total number of observations n = NM and the behavior 
of Ti(m) defined in (3.2). In the continuous model, the values of ri(m) are 
fixed; however, in the discrete model they may depend on the choice of M 
and the selection of points u\, I = 1,2, ... , M. Let us now explore when and 
how this can happen. 

Assume that there exist points u*, u* £ [a, b], —00 < a < b < 00 (with a < 
b in the continuous model while a = b is possible in the discrete model), 



FUNCTIONAL DECONVOLUTION 



19 



such that u* = arg mm u g m (u) and u* = arg max n g m (u) . (Obviously, this is 
true if the functional Fourier coefficients g m (') are continuous functions on 
the compact interval [a, b].) In this case, we have T\(m) > L^\g m {u^)\ 2 and 
Ti(m) < L*\g m (u*)\ 2 , where = L* = b — a in the continuous model and 
L* = L* = 1 in the discrete model. Assume also that we can observe y(u,t) 
at the points ti* and u* . The following statement presents the case when the 
minimax convergence rates cannot be influenced by the choice of M and the 
selection of points ui, I = 1,2, ... , M. 

Proposition 1. Let there exist constants v\ el, v 2 £ R, ct\ > 0, a 2 > 
0, Pi > 0, 02 > 0, L\ > and L 2 > 0, independent of m, such that 

(6.1) | 5m K)| 2 >L 1 |m|-^ 1 exp(-a 1 |m| /31 ), v 1 >Qifa 1 = Q, 

(6.2) \g m (u*)\ 2 <L 2 |mr 2 ^exp(-a 2 |m| /32 ), i/ 2 >0i/a 2 = 0, 

where either a\a 2 > and Pi = (3 2 or at\ = a 2 = and v\ = v 2 . Then, the 
minimax convergence rates obtained in Theorems 1 and 2 in the discrete 
model are independent of the choice of M and the selection of points ui, 
1 = 1,2, ...,M, and, hence, coincide with the minimax convergence rates 
obtained in Theorems 1 and 2 in the continuous model. 

The validity of Proposition 1 follows trivially from the lower and upper 
bounds obtained in Theorems 1 and 2. Proposition 1 simply states that 
asymptotically (up to a constant factor) it makes absolutely no difference 
whether one samples (1.4) n times at one point, say, U\ or, say, \fri times 
at M = yfn points u\. In other words, asymptotically (up to a constant 
factor) each sample value y(ui,ti), I = 1,2, ... , M, i = 1, 2, . . . , N, gives the 
same amount of information and the minimax convergence rates are not 
sensitive to the choice of M and the selection of points ui, I = 1,2, ... , M. 
The constants in Theorem 2 will, of course, reflect the difference and will be 
the smallest if one samples (1.4) n times at u* . 

However, conditions (6.1)-(6.2) are not always true. Consider, for ex- 
ample, the case when g(u,x) = (2n)~ 1 I(|x| < u), that is, the case of a 
boxcar-type convolution for each u G [a, b], < a < b < oo. Then, g m (u) = 
sin(27rm«)/(27rmn) and \g m (u*)\ 2 = 0; indeed, for rational points u = l\jl 2 G 
[a, b], the functional Fourier coefficients g m {u) vanish for any integer m mul- 
tiple of l 2 . This is an example where a careful choice of ui, I = 1, 2, . . . , M, 
can make a difference. For example, in the multichannel boxcar decon- 
volution problem (see also Example 5), De Canditiis and Pensky (2006) 
showed that if M is finite, M > 2, one of the ui's is a "badly approximable" 
(BA) irrational number, and u\, u 2 , . . . , um is a BA irrational tuple, then 
Ai(j) < Cj2^ 2+l / M ^ [for the definitions of the BA irrational number and 
the BA irrational tuple, see, e.g., Schmidt (1980)]. This implies that, in this 
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case, (the degree of ill-posedness is) v = 1 + 1/(2M). [The case M = 1, corre- 
sponding to the standard boxcar deconvolution problem, was considered by 
Johnstone, Kerkyacharian, Picard and Raimondo (2004) who showed that 
v = 3/2 when u\ is a BA irrational number.] Furthermore, De Canditiis and 
Pensky (2006) obtained asymptotical upper bounds (in the L w , 1 < ir < oo) 
for the error, for a wavelet estimator, for a fixed response function. They 
also showed that these bounds depend on M and the larger the M, is the 
higher the asymptotical convergence rates will be. Hence, in the multichan- 
nel boxcar deconvolution problem, it is advantageous to take M — > oo and 
to choose tii, U2, ■ ■ ■ ? u-M to be a BA tuple. 

However, the theoretical results obtained in Theorems 1 and 2 cannot be 
blindly applied to accommodate the blurring scenario represented by the 
case of boxcar-type convolution for each fixed u, that is, the case when 
g(u,x) = (2n) _1 I(|x| < u), u G [a,b], < a < b < oo. A careful treatment 
of this problem is necessary, since it requires nontrivial results in number 
theory. This is currently under investigation by the authors and the results 
of the analysis will be published elsewhere. 

7. Proofs. In what follows, for simplicity, we use the notation g instead 
of <?(•), for any arbitrary function g(-). Also, ipjj. refer to the periodized 
Meyer wavelets defined in Section 2. 

7.1. Lower bounds. 

Proof of Theorem 1. The proof of the lower bounds falls into two 
parts. First, we consider the lower bounds obtained when the worst functions 
/ (i.e., the hardest functions to estimate) are represented by only one term 
in a wavelet expansion (sparse case), and then when the worst functions / 
are uniformly spread over the unit interval T (dense case). 

Sparse case. Consider the continuous model (1.3). Let the functions fjk 
be of the form fj k = Jjipjk and let /o = 0. Note that by (3.1), in order 
fjk £ Bp J A), we need jj < A2~i s ' . Set jj = c2~i s ' , where c is a positive 
constant such that c < A, and apply the following classical lemma on lower 
bounds: 

Lemma 3 [Hardle, Kerkyacharian, Picard and Tsybakov (1998), Lemma 
10.1]. Let V be a functional space, and let d(v) oe a distance on V. For 
f,g&V, denote by A n (f,g) the likelihood ratio A n (f,g) = dP (f)/dF ( 9 ) , 
where dP v (h) is the probability distribution of the process X n when h is true. 
Let V contains the functions fo, fi, ■ ■ ■ , fn such that: 

(a) d(f k ,f k ,)> 8> for k = 0,1,..., X, k^k', 
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(b) N > exp(A n ) for some X n > 0, 

(c) lnA n (/o,/fc) = u nk — v nk , where v nk are constants and u nk is a ran- 
dom variable such that there exists ttq > with Ff k (u nk > 0) > ttq, 

(d) sup k v nk < \ n . 

Then, for an arbitrary estimator f n , 

supF (f) (d(f n ,f)>5/2)>ir /2. 
fev n 

Let now V = {f jk :0<k< 2? - 1} so that N = 2K Choose d(f,g) = \\f-g\\, 
where || • || is the L 2 -norm on the unit interval T. Then, d(fj k , fjk'} = Ij = <5- 
Let v nk = \ n = j\u2 and u nk = lnA n (/ , fj k ) + j ln2. Now, to apply Lemma 
3, we need to show that for some ttq > 0, uniformly for all fj k , we have 

f fjk (u nk > 0) = P /jfe (lnA n (/ , f jk ) > -j In 2) > vr > 0. 

Since, by Chebyshev's inequality, 

Ef.. I lnA n (/ ,/ 7 -fc)| 
P /jfc (lnA n (/ , f jk ) > -jln2) > 1 - ^ . 

we need to find a uniform upper bound for E/. fe | In A n (/ , . 

Let W(u, t) and W(u, t) be Wiener sheets on U x T. Let z(n, i) = -^/n (g * 
fjk){u, t) + i), where i) = W(u, t) and i) = W(u,t) [i.e., W(u, i) 
and W(u,t) are the primitives of z(u,t) and z(u,t), resp.]. Then, assum- 
ing that fj<fijn(g * fj k ) 2 (u,t)dudt < oo, by the multiparameter Girsanov 
formula [see, e.g., Dozzi (1989), page 89], we get 

(7.1) -lnA„(/ ,/,- fc ) = v^/ ! (g*f jk )(u,t)dW(u,t) 

JT JU 



(g* fjk) 2 (u,t)dudt. 

'tJu 

E fjk \]nA n (f J jk )\<A n + B n , 



Hence, 



where 



3 

'TJU 



(ip jk *g)(u,t)dW(u,t) 



B n = O.bnjj J (ipjk* 9) 2 (u,t) dudt. 

Since, by Jensen's inequality, A n < \/2B n , we only need to construct an 
upper bound for B n . For this purpose, we denote the Fourier coefficients 
of tp(-) by ip m = (e m ,ip), and observe that in the case of Meyer wavelets, 
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iVVnjfcl < 2~ J / 2 [see, e.g., Johnstone, Kerkyacharian, Picard and Raimondo 
(2004), page 565]. Therefore, by properties of the Fourier transform, we get 



(7.2) B n = o(V% 7 J £ / 



g m (u)\ 2 du 



Let j = j n be such that 



/ 7 „\ -Bn + V2.B. ra 1 

(7 ' 3j jln2 " 2' 

Then, by applying Lemma 3 and Chebyshev's inequality, we obtain 

inf sup E||/ n -/|| 2 >infsupi 7 |P(ll/n-/||>7 J 72) 
f n feB*(A) f n fev 

(7-4) 

> hi^o- 

Thus, we just need to choose the smallest possible j = j n satisfying (7.3), 
to calculate 7j = c2~ 3S , and to plug it into (7.4). By direct calculations, we 
derive, under condition (3.3), that 

f | , N,2^ / C2^ 2v ~ l \ if a = 0, 

l_ J v IfeWI S j ^2^(2^.!) exp (_ a ( 27r/ 3)/3 2 ^), if a > 0, 

(7.5) 

so that (7.3) yields 2 jn = C(n/lnra) 1 /( 2s ' +2l/ ) if a = and 2> l = C(lnrt) 1 ^ if 
a > 0. Hence, (7.4) yields 

. - ,„2^ f C(lnn/n) 2s '/(2 S '+2^) if a = 0, 

(7.6 mf sup E /„-/ ><^), (.aV/fl ., ' 

/„ feB* LC(lnn) ^ s/p , if a>0. 



The proof in the discrete case is almost identical to that in the continuous 
case with the only difference that [compare with (7.1)] 

N M 

-lnA ri (/ ,/ jfc ) = 0.5^2^2{[y(ui,ti) - 7j(^jfc* g)] 2 (ui,ti) -y 2 {u u ti)} 
i=l 1=1 

where 

N M 

u jk = Ij^^ii'jk * g)(ui,ti)eu, 
i=i i=i 

N M 

v jk = 0.5jjJ2J2( t t ; jk*g) 2 (ui,ti). 
1=1 1=1 
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Note that, due to P(e H > 0) = P(e h < 0) = 0.5, we have P(u jk > 0) = 0.5. 
Also, by properties of the discrete Fourier transform, we get 

M 

meCj 1=1 

By replacing B n and B n + \JB n with Vjk in the proof for the continuous 
case, and using (3.3), we arrive at (7.6). 



Dense case. Consider the continuous model (1.3). Let r\ be the vector 
with components n k = ±1, k = 0, 1, . . . , 2? — 1, denote by S the set of all 
possible vectors 77, and let fj V = JjJ2k=o Vk^jk- Let also rf be the vector 
with components rf k = (— l) l<y%=k ^ n k for i, k = 0, 1, . . . , 2 J — 1. Note that by 
(3.1), in order f jv £ B^ q (A), we need lj < A2~^ s+l / 2 \ Set 7j = c^2^ s+1 l 2 \ 
where c* is a positive constant such that c* < A, and apply the following 
lemma on lower bounds: 



Lemma 4 [Wilier (2005), Lemma 2]. Let A n (f,g) be defined as in Lemma 3, 
and let rj and fj v be as described above. Suppose that, for some positive con- 
stants A and ttq, we have 

P fjri (-]nA n (f jv iJ jv )<X)>7r , 

uniformly for all fj v and all i = 0, . . . , 2 3 — 1. Then, for any arbitrary esti- 
mator f n and for some positive constant C , 

maxE/.J/,, - f jv \\ > Cvr e- A 2^ 2 7 , 

Hence, similarly to the sparse case, to obtain the lower bounds it is suffi- 
cient to show that 

E fj,M nA n(fj^,fjv)\ < A l) 

for a sufficiently small positive constant Ax- Then, by the multiparameter 
Girsanov formula [see, e.g., Dozzi (1989), page 89], we get 

lnA n (/ ir? i , f jrj ) = y/n (g* (f jr)i - f jv ))(u, t) dW(u, t) 

J T J ZJ 



{g*(fjr,i - fjri)) 2 {u,t)dudt, 
IT JU 



and recall that |/,-„i — fj v )\ = 21^1. Then, 

E fjv \ \nA n (f jr)i J jv )\ <A n + B n , 
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where 



A n = 2^jE / / {iP ji *g)(u,t)dW(u,t] 
JtJu 

B n = 2n^ 2 J J (ipji * g) 2 (u,t) dudt. 



Hence, similarly to the sparse case, A n < \/2B n and (7.2) is valid. According 
to Lemma 4, we choose j = j n that satisfies the condition B n + \J2B Tl < Ai- 
Using (7.5), we derive that 2^ = C n l/(2*+2W-i) if a = and 2^ = C(lnn) 1 /' 3 
if a > 0. Therefore, Lemma 4 and Jensen's inequality yield 

(7.7 mf sup E /«-/ > . ,_ 2s /fl ., ' 

/» /6BS„ lC(lnn) ^/P, if q>0. 



The proof can be now extended to the discrete case in exactly the same 
manner as in the sparse case. Now, to complete the proof one just needs to 
note that s* = min(s,s'), and that 

(7.8) 2s/(2s + 2z/ + l) <2s* /{2s* + 2u) if u(2-p) <ps*, 

with the equalities taken place simultaneously, and then to choose the high- 
est of the lower bounds (7.6) and (7.7). This completes the proof of Theorem 
1. □ 

7.2. Upper bounds. 

Proof of Lemma 1. In what follows, we shall only construct the proof 
for bjk [i.e., the proof of (4.8)] since the proof for aj k [i.e., the proof of 
(4.7)] is very similar. First, consider the continuous model (1.3). Note that, 
by (2.9), 

where 

(7.9) f m - f m = n~ 1/2 (^J g m (u)z m (u)duj^J \g m (u)\ 2 du 

due to (2.2) and (2.4). Recall that z m (u) are Gaussian processes with zero 
mean and covariance function satisfying (2.3). Hence, it is easy to check that 



E[(/mi -/mi)(/m 2 ~ fm 2 )] = n ViO 7 ^)] 1 d~(m 1 -m 2 ), 
implying that 

E\b jk -b jk \ 2 = n~ l ^2 \^mjk\ 2 [n{rn)}~ 1 , 
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where T\(m) is defined in (3.2) (the continuous case). To complete the proof 
of (4.8) in the case of k = 1, just recall that \Cj\ = 47r2- 7 and \ip m jk\ 2 < 2~ 3 . 
If k = 2, then 

n 2\ 



E|6 ifc -6 jfc | 4 = E E|/ m -/ m | 4 )+0\ 



E 



LmeC, 



O n 2 E l^mjA:| 4 T2(m)[Ti(m)]" 



m£Ci 



+ 0\n 



= 0(n- 2 2~J A 2 (j)) + 0(n~ 2 A 2 (j)) = 0(n~ 2 A 2 (i)), 

since, by the Cauchy-Schwarz inequality, A 2 (j) < A 2 (j). This completes the 
proof of (4.8) in the continuous case. 

In the discrete case, formula (7.9) takes the form [see (2.6)] 



(7.10) 



fm ~ fm = N~ 1 ' 2 (E 9m{ui) Z m ^j j (^T 



9m(ui)\' 



where z m \ are standard Gaussian random variables, independent for different 
m and I. Therefore, similarly to the continuous case, 



E\b jh -b jh \ 2 = N- 1 E Wmjk\ 2 



m£Cj 



M 



E \9m(ui) 



1=1 



In the case of k = 2, note that 

nbjk - M 4 = 0(2-iN- 2 M- 3 A 2 (j) + N~ 2 M~ 2 Af(j)) = 0(n~ 2 A 2 (j)), 

by applying again the Cauchy-Schwarz inequality. This completes the proof 
of (4.8) in the discrete case. 

The last part of the lemma follows easily from (4.2) with k = 2, using the 
assumption (3.3) and the Cauchy-Schwarz inequality, thus completing the 
proof of Lemma 1. □ 

PROOF of Lemma 2. Consider the set of vectors 



1jr= <v k ,keUj r : E M 2 <lf, 



and the centered Gaussian process defined by 

z jr(v)= E v k(bjk-b jk ) 



k&Ji, 



The proof of the lemma is based on the following inequality: 



26 



M. PENSKY AND T. SAPATINAS 



Lemma 5 [Cirelson, Ibragimov and Sudakov (1976)]. Let D be a subset 
o/R = (—00, 00), and let (£t)teD be a centered Gaussian process. If E,(sup teD £t) < 
B\ and sup i6 £> Var(^) < B2, then, for all x > 0, we have 

(7.11) P(sup& > x + Bj) < exp(-x 2 /(2B 2 )). 

\teD / 



To apply Lemma 5, we need to find -Bi and B 2 . Note that, by Jensen's 
inequality, we get 

1/2 

\ b jk-b jk \ 2 



E 


sup Zj r (v) 


= E 









.feet/,-. 



< 



]T E\b jk -b jk \ 



keu T 



1/2 



n. 



[Here, c\ is the same positive constant as in (4.3) with a = 0.] Also, by (2.3) 
and (7.9) or (7.10), we have 

H(bjk -b jk )(bjk' ~ b jk ')] =n~ l ^ '<Pmjk'ipmjk'[n(rn)]~ 1 , 



where 7i(m) is defined in (3.2). Hence, 



sup Vax(Z jr (v)) = n 1 sup ^ ^ v k v k > ^ ^mjkipmjk'[Ti{m)] 1 

kelljr 



by using £ meCj . ^mjk^mjk' = I(& = and (4.3) for a = 0. Therefore, by 
applying Lemma 5 with i?i = Jcin~ 1 ' 2 2^v / hw : -B2 = c\n~ 1 2 2v i and x = 
(0.5// - v ^)n~ 1 / 2 2^ v / lnn, we get 



'( S l&?fc-^fc| 2 >0.25/i 2 n _1 2 2i/J 'lnn) 
\keu ir 1 



\bjk-b jk \ 2 



jteu*. 



1/2 



> ^/5Tn" 1/2 2^v / hi^ + : 



< exp(-(2ci)- 1 (0.5^ - y/b~[) 2 Inn) < n" 9 , 

where 9 = (8u - Au x + 2)/{2v + 1), provided that fi > 2y/c[(l + y/29). This 
completes the proof of Lemma 2. □ 
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Proof of Theorem 2. First, note that in the case of a > 0, we have 

E\\f n -f\\ 2 = R 1 +R 2 , 

where 

oo 2*-l 2*0-1 

(7.12) i?i = £ E &? fc , i?2= E E (%ofc " %ofe) 2 ; 

j=J fc=0 fc=0 

since jo = J. It is well known [see, e.g., Johnstone (2002), Lemma 19.1] that 
if / € BpJA), then for some positive constant c*, dependent on p, g, s and 
A only, we have 

(7-13) 2 ^b%<c*2-^ s \ 

k=0 

thus, i?i = 0{2- 2Js *) = 0((lnn)- 2s *// ? ). Also, using (4.3) and (4.7), we de- 
rive 

R 2 = OCn" 1 ^ Ai(io)) = O^-^Onra) 2 ^) = o((lnn)- 2s */' 3 ), 

thus completing the proof for a > 0. 

Now, consider the case of a = 0. Due to the orthonormality of the wavelet 
basis, we get 

(7.14) E\\f n - f\\ 2 = R 1 +R 2 + R3 + R±, 
where R\ and R 2 are defined in (7.12), and 

J-i 

^3 = E E E E [& - b jk ?l(B jr > dn~ l 2 2 ^ Inn)], 
3=30 reAj k&Uj r 

J-1 

i? 4 =E E E nb 2 jk l{B jr <drr x 2 2v hnn)}, 

3=30 r&Aj kelfjr 

where Bj r and d are given by (2.10) and (4.6), respectively. 

Let us now examine each term in (7.14) separately. Similarly to the case of 
a > 0, we obtain R 1 = 0(2~ 2Js *) = 0(n~ 2s */(2iH-i)^ By direct calculations, 
one can check that 2s* /(2v + 1) > 2s/(2s + 2v + 1), if v{2 — p) < ps*, and 
2s* /{2v + 1) > 2s*/(2s* + 2v), if v{2 -p)> ps* . Hence, 

(7.15) R 1 = 0{n~ 2s l { - 2s+2v+ V) ifu(2-p)<ps*, 

(7.16) R x = o(n- 2s *tt 2s *+ 2 ^) if i/(2 - p) > ps* . 
Also, by (4.7) and (4.3), we get 

R 2 = 0{n~H 2v+1 ^) = 0(n _1 (In n) 2 " +1 ) 

(7.17) 

= ( n - 2s /( 2s+2l/+1 )) = o(rT 2s * /(2s * +2i ' ) ) 
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To construct the upper bounds for R3 and i?4, note that simple algebra 
gets 



(7.18) 
where 



R3 < (-R31 + R32), R4 < (-R41 + -R42), 



(b jk ~ b jk fl E ~ b jk \ 2 > OSSdn-^ In 



%=EE E e 

i=io r-GAj feeC7 3 v 

j-i 

^32 = E E E E K^fc - b jk ) 2 I(B jr > O^dn-^i Inn)], 

i=io r-GAj feef/jr 



11 



h %A E Ibifc-bifcP^O^dn-^^'ln 

\keu ir 



n 



^i = EE E E 

J-i 

i?42=E E E E[$feI(B ir <2.5dn -1 2^"kin)], 

since 6 2 fc < 2(bjk — bjk) 2 + 2b 2 - k . Then, by (7.13), Lemmas 1 and 2, and the 
Cauchy-Schwarz inequality, we derive 

^31 + ^41= EE E E U(b jk -b jk ) 2 + b%) 

3=30 re Aj keUj r \ 



l( E fijk - bjk\ 2 > 0.25dn2 2uj Inn] j 



'J-l 



E E E (V E (^-M 4 +^) 

\?=J0 ^GAj fcGf/jr 



1 



E fo'fe - 6 Jfcl 2 ^ 0.25(i?i2 2 ^' Inn) J 



' J-i 



O E [2%- 1 2 2 ( 2l/ -" 1 ^ + 2- 2js *]n 



= 0^), 

provided d > ci(l + v 7 ^) 2 , where = {Su - \v x + 2)/(2i/ + 1) and ci is the 
same positive constant as in (4.3) with a = 0. Hence, 

(7.19) Ai=i2 3 i + H4 1 =0(w -1 )- 
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Now, consider 
(7.20) 

Let ji be such that 
(7.21) 



A 2 — R32 + -R42 • 



231 



l/(2s+2i/+l) 



(Inn 



,ei/(2f+i) 



n 

where £1 is defined in (4.11). 

First, let us study the dense case, that is, when v{2 — p) < ps* . Then, 
A2 can be partitioned as A2 = A21 + A22, where the first component is 
calculated over the set of indices jo < j < ji and the second component over 
ji + 1 < j < J— 1. Hence, using (2.10) and Lemma 1, and taking into account 
that the cardinality of Aj is = 2-?/mn, we obtain 



n 



A 



21 



E 



\7=J0 



2^-1 



(7.22) 



E - 

.fe=0 

+ E B jr I(B jr < 2Mn~ 1 2 2vj \n 



n] 



n 



°[ E 



\7=J0 L 



n 



-i 2 (2H-i)i + £ n -l2 2 ^'ln 



n 



= G^n"^ 2 ^ 1 ^) = 0(n- 2s /( 2s+2 ^ +1 )(lnn)^). 
To obtain an expression for A22, note that, by (7.13), and for p > 2, we have 



J-i 



A 22 = O E E \ n ~ l22uj fonI(B jr > 0.25dn- 1 2 2uj Inn) + B jr ] 



j-i 



(7.23) =0 E E^> =0 E 2- 2js 
V^'i+lreAj / \j=ii+l / 

= 0(n _2s/(2s+2iy+1) ). 

If 1 < p < 2, then 



/ \p/2 

<= Eft - E 

\k£Ujr ' k£Uj r 



so that by Lemma 1, and since i/(2 — p) <ps*, we obtain 



J-l 



A 22 = O E E [ n_1 2 2 ^ InnI(Bj r > O^dn"^ 2 ^' In 

\j'=ji+l r-eAj 



n 
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(7.24) + B jr I(B jr < 2.5dn~ 1 2 2uj Inn)] J 

= ( E E [(« _1 2 2 " j lnn) 1 ^/ 2 ^/ 2 

Vj=ii+lreAj 

= o( ]T (n^nnnf^Y. E w) 
= °f E (""^^'lnn) 1 -^^-^ 5 ^ 

Vj=ji+i / 

= °f E (n 4 ]nn) 1 " p/2 2 (2 "" ! '^ ps,b '| 
Vj=ji+i / 

= 0((n~ 1 lnn) 1 ~ p / 2 2^ u ~ pu ~ ps ^ jl ) 
= ( n -V(2*+2H-l)( ]nn)<? i). 

Let us now study the sparse case, that is, when z^(2 — p) > ps* . Let ji 
be defined by (7.21) with q x = 0. Hence, if B jr > ^.2hdn~ x 2 2v ^ Inn, then 

Ylk=Q^ 2 jk — 0-25dn~ 1 2 ( - 2u+1 ^ , implying that j cannot exceed j% such that 
2-? 2 = (Ac*n/(d\nn)) l /( 2s * +2u \ where c* is the same constant as in (7.13). 
Again, partition A2 = A21 + A22, where the first component is calculated 
over jo < j < 22 and the second component over ji + 1 < j < J — 1. Then, 
using arguments similar to those in (7.24), and taking into account that 
v(2 — p) > ps*, we derive 

A 21 = o(f>- 1 2 2 ^lnn) 1 ^/ 2 £ £ \b jk \ p ) 
\j=jo 

h 



0\ ^(n- 1 2 2 ^lnn) 1 - p / 2 2-^ s * 



\?=.?0 
32 



(7.25) = 0\ Y.( n ~ llnn ) 1 ~ P/22i2U ~ PU ~ PS * )j 



V=30 



0((n- 1 lnn) 1 - p/2 2^ 2u - pu - ps ^ j2 ) 
0((lnn/n) 2s */( 2s * +2 ^). 
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To obtain an upper bound for A22, recall (7.20) and keep in mind that the 
portion of R32 corresponding to 32 + 1 < J ' < J ~ 1 is just zero. Hence, by 
(7.13), we get 



J-l 2 3 -l 



J-l 



(7.26) 



\j=j2+i k=o / M=i 2 +i 



■2js* 



0((lnn/n) 2s */( 2s *+ 2 ^). 



Now, in order to complete the proof, we just need to study the case when 
v(2 -p) =ps*. In this situation, we have 2s/{2s + 2v + 1) = 2s* /(2s* + 2u) = 
1 —p/2 and 2vj(l — p/2) =pjs*. Recalling (3.1) and noting that s* < s', we 
get 



J-i 



23-1 



q/p 



E 2P]S * E I 6 



3=30 



k=0 



Then, we repeat the calculations in (7.25) for all indices jo < j < J — 1. If 
1 <p<q, then, by Holder's inequality, we get 



'J-i 



2-7-1 



A 2 = ]T (In n/n) 1 -^** £ |6 jfc | 
\i=io fc=o 



(7.27) = O (lnn/n) 1 ~ p / 2 (lnn) 1 - p / q 



J-l 



2-7-1 



£ 2^ s * e 



■J=J0 



fc=0 



P/<?N 



= 0((lnn/n) 2s */( 2s * +2 ^(lnn) 1 - p /''). 
If 1 < (7 < p, then, by the inclusion Bp J A) C (A), we get 



'J-i 



2^-1 



A 2 = ^(lnn/n) 1 -^^* £ \b jk \* 



\7=jo 



(7.28) 



O (lnn/n) 1 "^ 2 



J-i 



k=0 
21-1 



y: e w 

Lj=j k=0 

0((lnn/n) 2s */(2 S *+2,) ) . 



By combining (7.15)-(7.17), (7.19), (7.22)-(7.28), we complete the proof of 
Theorem 2. □ 
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